Fix pipelined transpose empty rank skips#144
Conversation
980e9ef to
ad5239b
Compare
romerojosh
left a comment
There was a problem hiding this comment.
Hi @fallintoplace, thanks for yet another contribution! I had to use codex to get a grasp on what exactly this PR was attempting to fix as the PR comment is a bit vague and doesn't get to the meat of what these changes are solving. Just some feedback since I know that these PRs are being driven by codex or similar tool.
For future reference, this PR seems to be specifically addressing an out of bounds access that can happen with performance reporting in the pipelined backends due to this line
cuDecomp/include/internal/comm_routines.h
Line 459 in 3a8d786
because the
src_ranks and dst_ranks arrays are resized to 0 when paired like in the code here: cuDecomp/include/internal/transpose.h
Lines 694 to 696 in 3a8d786
The quick return in cudecompAlltoallPipelined is sufficient to address this issue. The other changes should be removed as per the inline comments.
Signed-off-by: Minh Vu <vuhoangminh97@gmail.com>
ad5239b to
e2b51a6
Compare
|
/build |
|
🚀 Build workflow triggered! View run |
|
✅ Build workflow passed! View run |
Summary
Validation
git diff --check